Model Selection

Efficient quantization

# Efficient quantization

Baidu ERNIE 4.5 0.3B PT GGUF

A quantized version based on the Baidu ERNIE-4.5-0.3B-PT model, optimized through the llama.cpp tool to reduce the model size and improve the running efficiency.

Large Language Model Supports Multiple Languages

BAAI RoboBrain2.0 7B GGUF

This is the quantization version of BAAI's RoboBrain2.0-7B model, which is quantized through llama.cpp and provides various quantization types to meet different hardware requirements.

Large Language Model

Sophosympatheia StrawberryLemonade L3 70B V1.0 GGUF

StrawberryLemonade-L3-70B-v1.0 is a quantized large language model designed to run efficiently under different hardware conditions.

Large Language Model English

Wan14bt2vfusionx Fp16 GGUF

Wan14BT2VFusionX is a text-to-video generation model that supports video generation through the ComfyUI - GGUF custom node.

Video Processing

Qwen3 0.6B GGUF

Qwen3 is the latest version of the Tongyi Qianwen series of large language models, offering a range of dense and Mixture of Experts (MoE) models. Based on large-scale training, Qwen3 has achieved breakthrough progress in reasoning capabilities, instruction following, agent functionalities, and multilingual support.

Large Language Model English

Gemma 3 27b It Qat Unsloth Bnb 4bit

Gemma 3 is a lightweight, state-of-the-art multimodal open-source model launched by Google, capable of processing text and image inputs and generating text outputs.

3b Ko Ft Research Release Q4 K M GGUF

This is a 3B-parameter language model optimized for Korean, converted to GGUF format for compatibility with llama.cpp.

Large Language Model Korean

Gemma 3 4b It Qat Q4 0 Gguf

Gemma 3 is a lightweight open-source multimodal model family launched by Google, built on the same technology as Gemini, supporting text and image inputs and generating text outputs.

Cohereforai.c4ai Command R 08 2024 GGUF

The quantized version of the Command R model released by CohereForAI, aiming to make knowledge accessible to the public.

Large Language Model

Gemma 3 4b It GGUF

Gemma 3.4B IT is a lightweight open-source large language model released by Google. Based on a parameter scale of 3.4B, it is suitable for dialogue and instruction following tasks.

Large Language Model

Granite Embedding 107m Multilingual GGUF

A quantized version of the multilingual embedding model developed by the IBM Granite team, supporting text embedding tasks in 17 languages, suitable for scenarios such as retrieval and information extraction.

Text Embedding Supports Multiple Languages

Granite 8b Code Instruct 128k GGUF

IBM Granite 8B code instruction model, supporting a context length of 128k, focusing on code generation and instruction understanding tasks.

Large Language Model

Transformers Other

Qwen2.5 Coder 3B Instruct GGUF

Based on the Qwen2.5-Coder-3B-Instruct model, quantization processing is performed, providing an efficient and convenient solution for code generation and dialogue interaction.

Large Language Model

Transformers Supports Multiple Languages

Nasiruddin15 Mistral Dolphin 2.8 Grok Instract 2 7B Slerp GGUF

This is a 7B parameter model based on the Mistral architecture, optimized through quantization, offering various GGUF quantization versions to meet different hardware requirements.

Large Language Model

featherless-ai-quants

Molmo 7B O Bnb 4bit

The 4-bit quantized version of Molmo-7B-O, significantly reducing the memory requirement and suitable for environments with limited resources.

Large Language Model

Llama 3.2 1B Instruct GGUF

The GGUF format version of Llama-3.2-1B-Instruct, providing broader support and better performance.

Large Language Model

Openchat 3.6 8b 20240522 IMat GGUF

This is a version of the openchat/openchat-3.6-8b-20240522 model after Llama.cpp imatrix quantization. It provides files of different quantization types, making it convenient for users to download and use according to their needs.

Large Language Model

Deepseek V2 Lite IMat GGUF

The GGUF quantized version of DeepSeek-V2-Lite, processed by Llama.cpp imatrix quantization, reduces storage and computing resource requirements and facilitates deployment.

Large Language Model

Deepseek V2 Chat GGUF

The GGUF quantized version of DeepSeek-V2-Chat, suitable for local deployment and operation.

Large Language Model Supports Multiple Languages

Mixtral 8x7B V0.1 Turkish GGUF

A model fine-tuned on a specific Turkish dataset, capable of accurately answering information in Turkish and providing strong support for Turkish-related text generation tasks.

Large Language Model

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase